[RVV] add rvv f32 kernel for ppmm#9962
Conversation
| src/x32-transposec/gen/x32-transposec-8x8-rvv.c | ||
| src/x32-transposec/gen/x32-transposec-16x8-rvv.c | ||
| src/x32-transposec/gen/x32-transposec-32x8-rvv.c) | ||
| src/x32-transposec/gen/x32-transposec-32x8-rvv.c |
There was a problem hiding this comment.
In all 3 PRs, there will be merge conflicts because these are all added in the same place. Keeping these lists alphabetized will avoid this.
There was a problem hiding this comment.
Thank you for your review, I have made the changes.
|
Hi, @dsharlet |
|
ping @dsharlet |
|
Hi, @dsharlet @fbarchard |
|
@velonica0 can you please merge the master branch to re-trigger the tools to run? Same for any other outstanding branches that should be merged. |
Add RVV kernel for f32-ppmm with MR={1,2,4} and NR={m1,m2,m4} variants.
Tested on SpacemiT K1 CPU and K3 CPU, both VLEN=256.
Move f32-ppmm entries from end of PROD/NON_PROD lists to alphabetical position (between f32-maxpool and f32-qs8-vcvt) to avoid merge conflicts with other RVV PRs.
Hi, @dsharlet |
Part of #9954
Add RVV kernel for f32-ppmm with MR={1,2,4} and NR={m1,m2,m4} variants. Tested on SpacemiT K1 CPU and K3 CPU, both VLEN=256.